Rank in Wordlist | Frequency | Word |
---|---|---|
205 | 1977 | -- |
372 | 1146 | з-пад |
511 | 845 | з-за |
541 | 797 | па-беларуску |
1168 | 401 | Па-першае |
1385 | 347 | Па-другое |
1534 | 316 | па-ранейшаму |
1719 | 285 | па-за |
2113 | 231 | па-расейску |
2415 | 201 | 1990-х |
2596 | 186 | 90-х |
2642 | 183 | бел-чырвона-белы |
3269 | 149 | па-другое |
3338 | 145 | Нью-Ёрку |
3665 | 132 | больш-менш |
3913 | 124 | па-першае |
4208 | 115 | па-іншаму |
4264 | 113 | калі-небудзь |
4550 | 106 | што-небудзь |
4721 | 101 | З-за |
Rank in Wordlist | Frequency | Word |
---|---|---|
205 | 1977 | -- |
2642 | 183 | бел-чырвона-белы |
8530 | 53 | бел-чырвона-белыя |
11552 | 37 | бел-чырвона-белага |
11553 | 37 | бел-чырвона-белымі |
12261 | 34 | 391-22-24 |
12650 | 33 | бел-чырвона-белым |
13576 | 30 | 266-39-52 |
13585 | 30 | Бел-чырвона-белы |
17406 | 22 | бел-чырвона-белых |
Rank in Wordlist | Frequency | Word |
---|---|---|
30640 | 10 | --- |
39206 | 7 | 8-029-391-22 |
48963 | 5 | 8-029-391-22-24 |
90980 | 2 | Н-і-к-о-л-і |
115247 | 2 | та-та-та-та-тА |
119271 | 1 | ---БЛЕФ |
119272 | 1 | ---Зямлю |
119273 | 1 | ---ТАК |
119274 | 1 | ---там |
119411 | 1 | -ф-а-л- |
Rank in Wordlist | Frequency | Word |
---|---|---|
48963 | 5 | 8-029-391-22-24 |
90980 | 2 | Н-і-к-о-л-і |
115247 | 2 | та-та-та-та-тА |
119411 | 1 | -ф-а-л- |
122218 | 1 | 40-х--50-х |
122968 | 1 | 8-803-100-03-ХX |
122969 | 1 | 8-803-100-03-ХХ |
126466 | 1 | r-a-t--k-i-n-g |
137149 | 1 | Гродна-Баранавічы-Менск-Паставы-Браслаў |
137688 | 1 | Д-о-ж-д-ж |
Some languages allow the formation of longer word by composition using hyphens. Moreover, proper names may contain hyphens. Therefore we look for the most frequent words containing 1, 2, 3 or 4 hyphens.
Usually we find interesting words. But in the case of poor preprocessing there may be unexpected strings resulting from hyphenation etc. Words ending with an hyphen are usually not welcome, too.
For three hyphens:
select w_id-100,freq, word from words where word like "%-%-%-%" limit 10;
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots
3.12.4 Words containing special characters